Learning Speaker Representation with Semi-supervised Learning Approach for Speaker Profiling
A semi-supervised framework for speaker profiling that leverages external unlabelled corpora via supervised, unsupervised, and consistency training, achieving RMSE of 6.8 years on age estimation.